Model Selection

Quantized text generation

# Quantized text generation

Gemma 3 1b It Fast GUFF

Quantized version optimized for low-end hardware and CPU-only environments, achieving production-ready inference configurations under resource constraints

Large Language Model

ZeroWw is a quantized text generation model that uses f16 format for output and embedding tensors, while other tensors use q5_k or q6_k format, resulting in a smaller size with performance comparable to pure f16.

Large Language Model English

A quantized text generation model with output and embedding tensors in f16 format, while other tensors use q5_k or q6_k quantization, resulting in a smaller size with performance comparable to the pure f16 version.

Large Language Model English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase